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Abstract 

We present a new multi-dimensional data structure, which we call the skip quadtree (for point data in 
R 2 ) or the skip octree (for point data in R'', with constant d > 2). Our data structure combines the best 
features of two well-known data structures, in that it has the well-defined "box"-shaped regions of region 
quadtrees and the logarithmic-height search and update hierarchical structure of skip lists. Indeed, the 
bottom level of our structure is exactly a region quadtree (or octree for higher dimensional data). We 
describe efficient algorithms for inserting and deleting points in a skip quadtree, as well as fast methods 
for performing point location and approximate range queries. 

1 Introduction 

Data structures for multidimensional point data are of significant interest in the computational geometry, 
computer graphics, and scientific data visualization literatures. They allow point data to be stored and 
searched efficiently, for example to perform range queries to report (possibly approximately) the points that 
are contained in a given query region. We are interested in this paper in data structures for multidimensional 
point sets that are dynamic, in that they allow for fast point insertion and deletion, as well as efficient, in that 
they use linear space and allow for fast query times. 

Related Previous Work. Linear-space multidimensional data structures typically are defined by hierar- 
chical subdivisions of space, which give rise to tree-based search structures. That is, a hierarchy is defined 
by associating with each node v in a tree T a region R(v) in K d such that the children of v are associated 
with subregions of R(y) defined by some kind of "cutting" action on R(v). Examples include: 

• quadtrees [29] : regions are defined by squares in the plane, which are subdivided into four equal-sized 
squares for any regions containing more than a single point. So each internal node in the underly- 
ing tree has four children and regions have optimal aspect ratios (which is useful for many types of 
queries). Unfortunately, the tree can have arbitrary depth, independent even of the number of input 
points. Even so, point insertion and deletion is fairly simple. 

• octrees [19,29]: regions are defined by hypercubes in R rf , which are subdivided into 2 d equal-sized hy- 
percubes for any regions containing more than a single point. So each internal node in the underlying 
tree has 2 d children and, like quadtrees, regions have optimal aspect ratios and point insertion/deletion 
is simple, but the tree can have arbitrary depth. 
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• k-d trees [8]: regions are defined by hyperrectangles in R d , which are subdivided into two hyper- 
rectangles using an axis-perpendicular cutting hyperplane through the median point, for any regions 
containing more than two points. So the underlying tree is binary and has [logn] depth. Unfortu- 
nately, the regions can have arbitrarily large aspect ratios, which can adversely affect the efficiencies 
of some queries. In addition, maintaining an efficient k-d tree subject to point insertions and removal 
is non-trivial. 

• compressed quad/octrees [2,9-11]: regions are defined in the same way as in a quadtree or octree 
(depending on the dimensionality), but paths in the tree consisting of nodes with only one non-empty 
child are compressed to single edges. This compression allows regions to still be hypercubes (with 
optimal aspect ratio), but it changes the subdivision process from a four-way cut to a reduction to 
at most four disjoint hypercubes inside the region. It also forces the height of the (compressed) 
quad/octree to be at most 0(n). This height bound is still not very efficient, of course. 

• balanced box decomposition (BBD) trees [4-6]: regions are defined by hypercubes with smaller hy- 
percubes subtracted away, so that the height of the decomposition tree is 0(log«). These regions have 
good aspect ratios, that is, they are "fat" [17, 18], but they are not convex, which limits some of the 
applications of this structure. In addition, making this structure dynamic appears non-trivial. 

• balanced aspect-ratio (BAR) trees [14, 16]: regions are defined by convex polytopes of bounded aspect 
ratio, which are subdivided by hyperplanes perpendicular to one of a set of 2d "spread-out" vectors so 
that the height of the decomposition tree is 0(logn). This structure has the advantage of having convex 
regions and logarithmic depth, but the regions are no longer hyperrectangles (or even hyperrectangles 
with hyperrectangular "holes"). In addition, making this structure dynamic appears non-trivial. 

This summary is, of course, not a complete review of existing work on space partitioning data structures 
for multidimensional point sets. The reader interested in further study of these topics is encouraged to read 
the book chapters by Asano et al. [7], Samet [33-35], Lee [21], Aluru [1], Naylor [26], Nievergelt and 
Widmayer [28], Leutenegger and Lopez [22], Duncan and Goodrich [15], and Arya and Mount [3], as well 
as the books by de Berg et al. [12] and Samet [31, 32]. 

Our Results. In this paper we present a dynamic data structure for multidimensional data, which we call 
the skip quadtree (for point data in R 2 ) or the skip octree (for point data in R rf , for fixed d > 2). For the 
sake of simplicity, however, we will often use the term "quadtree" to refer to both the two- and multi- 
dimensional structures. This structure provides a hierarchical view of a quadtree in a fashion reminiscent 
of the way the skip-list data structure [24, 30] provides a hierarchical view of a linked list. Our approach 
differs fundamentally from previous techniques for applying skip-list hierarchies to multidimensional point 
data [23,27] or interval data [20], however, in that the bottom-level structure in our hierarchy is not a list — 
it is a tree. Indeed, the bottom-level structure in our hierarchy is just a compressed quadtree [2,9-11]. 
Thus, any operation that can be performed with a quadtree can be performed with a skip quadtree. More 
interestingly, however, we show that point location and approximate range queries can be performed in a 
skip quadtree in O(logra) and 0(e l ~ d logn + k) time, respectively, where k is the size of the output in the 
approximate range query case, for constant e > 0. We also show that point insertion and deletion can be 
performed in 0(log«) time. We describe both randomized and deterministic versions of our data structure, 
with the above time bounds being expected bounds for the randomized version and worst-case bounds for 
the deterministic version. 

Due to the balanced aspect ratio of their cells, quadtrees have many geometric applications including 
range searching, proximity problems, construction of well separated pair decompositions, and quality trian- 
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gulation. However, due to their potentially high depth, maintaining quadtrees directly can be expensive. Our 
skip quadtree data structure provides the benefits of quadtrees together with fast update and query times even 
in the presence of deep tree branches, and is, to our knowledge, the first balanced aspect ratio subdivision 
with such efficient update and query times. We believe that this data structure will be useful for many of 
the same applications as quadtrees. In this paper we demonstrate the skip quadtree's benefits for two simple 
types of queries: point location within the quadtree itself, and approximate range searching. 

2 Preliminaries 

In this section we discuss some preliminary conventions we use in this paper. 

Notational Conventions. Throughout this paper we use Q for a quadtree and p, q, r for squares or quarters 
of squares associated with the nodes of Q. We use S to denote a set of points and x,y,z for points in R d . 
We let p(x) denote the smallest square in Q that covers the location of some point x, regardless if x is in 
the underlying point set for Q or not. Constant d is reserved for the dimensionality of our search space, ~R d , 
and we assume throughout that d > 2 is a constant. In J-dimensional space we still use the term "square" to 
refer to a <i-dimensional cube and we use "quarter" for any of the \/2 d partitions of a square r into squares 
having the center of r as a corner and sharing part of r's boundary. A square r is identified by its center c(r) 
and its half side length s(r). 

The Computational Model. As is standard practice in computational geometry algorithms dealing with 
quadtrees and octrees (e.g., see [10]), we assume in this paper that certain operations on points in R d can 
be done in constant time. In real applications, these operations are typically performed using hardware 
operations that have running times similar to operations used to compute linear intersections and perform 
point/line comparisons. Specifically, in arithmetic terms, the computations needed to perform point location 
in a quadtree, as well as update and range query operations, involve finding the most significant binary 
digit at which two coordinates of two points differ. This can be done in 0(1) machine instructions if we 
have a most-significant-bit instruction, or by using floating-point or extended-precision normalization. If 
the coordinates are not in binary fixed or floating point, such operations may also involve computing integer 
floor and ceiling functions. 

The Compressed Quadtree. As the bottom-level structure in a skip quadtree is a compressed quadtree [2, 
9-11], let us briefly review this structure. 

The compressed quadtree is defined in terms of an underlying (standard) quadtree for the same point set; 
hence, we define the compressed quadtree by identifying which squares from the standard quadtree should 
also be included in the compressed quadtree. Without loss of generality, we can assume that the center of 
the root square (containing the entire point set of interest) is the origin and the half side length for any square 
in the quadtree is a power of 2. A point x is contained in a square p iff — s(p) < x, — c(p)j < s(p) for each 
dimension i € [1, • ■ • ,d\. According to whether x, — c{p)i < or > for all dimensions we also know in 
which quarter of p that x is contained. 

Define an interesting square of a (standard) quadtree to be one that is either the root of the quadtree 
or that has two or more nonempty children. Then it is clear that any quadtree square p containing two or 
more points contains a unique largest interesting square q (which is either p itself of a descendent square 
of p in the standard quadtree). In particular, if q is the largest interesting square for p, then q is the LCA 
in the quadtree of the points contained in p. We compress the (standard) quadtree to explicitly store only 
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the interesting squares, by splicing out the non-interesting squares and deleting their empty children from 
the original quadtree. That is, for each interesting square p, we store 2 d bi-directed pointers one for each 
J-dimensional quarter of p. If the quarter contains two or more points, the pointer goes to the largest 
interesting square inside that quarter; if the quarter contains one point, the pointer goes to that point; and 
if the quarter is empty, the pointer is NULL. We call this structure a compressed quadtree [2,9-11]. (See 
Fig.Ql) ' 

i i ra i i i 



Figure 1: A quadtree containing 3 points (left) and its compressed quadtree (right). Below them are the 
pointer representations, where a square or an interesting square is represented by a square, a point by a solid 
circle and an empty quarter by a hollow circle. The 4 children of each square are ordered from left to right 
according to the I, II, III, IV quadrants. 

A compressed <i-dimensional quadtree Q for n points has size 0(n), but its worst-case height is 0(n), 
which is inefficient yet nevertheless improves the arbitrarily-bad worst-case height of a standard quadtree. 
These bounds follow immediately from the fact that there are 0(n) interesting squares, each of which has 
size 0(2 d ). 

With respect to the arithmetic operations needed when dealing with compressed quadtrees, we assume 
that we can do the following operations in 0(1) time: 

• Given a point x and a square p, decide if x € p and if yes, which quarter of p contains x. 

• Given a quarter of a square p containing two points x and y, find the largest interesting square inside 
this quarter. 

• Given a quarter of p containing an interesting square r and a point i^r, find the largest interesting 
square inside this quarter. 

A standard search in a compressed quadtree Q is to locate the quadtree square containing a given point 
x. Such a search starts from the quadtree root and follows the parent-child pointers, and returns the smallest 
interesting square p(x) in Q that covers the location of x. Note that p{x) is either a leaf node of Q or an 
internal node with none of its child nodes covering the location of x. If the quarter of p{x) covering the 
location of x contains exact one point and it matches x, then we find x in Q. Otherwise x is not in Q, but 
the smallest interesting square in Q covering the location of x is found. The search proceeds in a top-down 
fashion from the root, taking 0(1) time per level; hence, the search time is 0(n). 
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Inserting a new point starts by locating the interesting square p(x) covering x. Inserting x into an empty 
quarter of p(x) only takes 0(1) pointer changes. If the quarter of p(x) x to be inserted into already contains 
a point y or an interesting square r, we insert into Q a new interesting square q C p that contains both x and y 
(or r) but separates x and y (or r) into different quarters of q. This can be done in 0(1) time. So the insertion 
time is 0(1), given p(x). 

Deleting x may cause its covering interesting square p(x) to no longer be interesting. If this happens, we 
splice p(x) out and delete its empty children from Q. Note that the parent node of p(x) is still interesting, 
since deleting x doesn't change the number of nonempty quarters of the parent of p(x). Therefore, by 
splicing out at most one node (with 0(1) pointer changes), the compressed quadtree is updated correctly. 
So a deletion also takes 0(1) time, given p{x). 

Theorem 1 Point-location searching, as well as point insertion and deletion, in a compressed d-dimensional 
quadtree ofn points can be done in 0(n) time. 

Thus, the worst-case time for querying a compressed quadtree is no better than that of brute-force search- 
ing of an unordered set of points. Still, like a standard quadtree, a compressed quadtree is unique given a 
set of n points and a (root) bounding box, and this uniqueness allows for constant-time update operations if 
we have already identified the interesting square involved in the update. Therefore, if we could find a faster 
way to query a compressed quadtree while still allowing for fast updates, we could construct an efficient 
dynamic multidimensional data structure. 

3 The Randomized Skip Quadtree 

In this section, we describe and analyze the randomized skip quadtree data structure, which provides a hier- 
archical view of a compressed quadtree so as to allow for logarithmic expected-time querying and updating, 
while keeping the expected space bound linear. 

Randomized Skip Quadtree Definition. The randomized skip quadtree is defined by a sequence of com- 
pressed quadtrees that are respectively defined on a sequence of subsets of the input set S. In particular, we 
maintain a sequence of subsets of the input points S, such that So = S, and, for i > 0, S; is sampled from S,_i 
by keeping each point with probability 1/2. (So, with high probability, Saiogn = ®-) F° r each S;, we form 
a (unique) compressed quadtree Qj for the points in 5,. We therefore view the Qfs as forming a sequence 
of levels in the skip quadtree, such that So is the bottom level (with its compressed quadtree defined for the 
entire set S) and S top being the top level, defined as the lowest level with an empty underlying set of points. 

Note that if a square p is interesting in Qi, then it is also interesting in <2i_i. Indeed, this coherence 
property between levels in the skip quadtree is what facilitates fast searching. For each interesting square p 
in a compressed quadtree Qi, we add two pointers: one to the same square p in 0;-i and another to the same 
square p in Qi+\ if p exists in <2/+i, or NULL otherwise. The sequence of O/'s and Si's, together with these 
auxiliary pointers define the skip quadtree. (See Fig. [3) 

Search, Insertion, and Deletion in a Randomized Skip Quadtree. To find the smallest square in Oo 
covering the location of a query point x, we start with the root square pi^tan in Qi- (I is the largest value for 
which Si is nonempty and / is O(logTi) w.h.p.) Then we search x in Q\ as described in Section|2j following 
the parent-child pointers until we stop at the minimum interesting square pi^nd in Qi that covers the location 
of x. After we stop searching in each Qi we go to the copy of pt^nd in <2;-i and let Pi-i >s tart = Pi, end to 
continue searching in Qi-\. (See the searching path of y in Fig. [2) 
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Figure 2: A randomized skip quadtree consists of Qq, Qi and Q2. (Identical interesting squares in two 
adjacent compressed quadtrees are linked by a double-head arrow between the square centers.) 

Lemma 2 For any point x, the expected number of searching steps within any individual Qi is constant. 
Proof: Suppose the searching path of x in Qi from the root of Q, is po,pi,"- ,p m - (See Qo in Fig. 0) 
Consider the probability Pr{j) of Event {j) such that p m -j is the last one in po,p\,- ■ ■ ,p m which is also 
interesting in <2i+i- (Note that Event {j) and Event(j') are excluding for any j 7^ /.) Then j is the number 
of searching steps that will be performed in Qi. We overlook the case j = since it contributes nothing to 
the expected value of j. 

Since each interesting square has at least two non-empty quarters, there are at least j + 1 non-empty 
quarters hung off the subpath p m -j, • • • ,p m . Event(j) occurs only if one (with probability Pr\) or zero (with 
probability Pro) quarters among these > j + 1 quarters is still non-empty in <2i+l- Otherwise, the LCA of 
the two non-empty quarters in Qi will be interesting in <2i+i- So Pr(j) < Pr\ + Pro < &r + jm- (E-g-, 
consider p m -j = po in Fig. |2] Note that this is not a tight upper bound.) The expected value of j is then 

m m 7_|_1 1 1 m 7*2 m j 1 

E(j) = ^jPr{j) < £j(^r + ^TTi) = 2 I 27 + ^ 27 " 2 X 6 -° + 2 -° = 5 "°- d> 

Consider an example that each pj has exact two non-empty quarters, and the non-empty quarter of pj 
that does not contain pj + \ contains exact one point xj. (For the two non-empty quarters of p m , we let each 
of them contain exact one point, and choose any one as x m .) Event(j) happens iff xj is selected to Sj + \ and 
another point among the rest m — j +1 points contained in pj is also selected. So Pr(j) = \ ■ &r. (E.g., 
consider p m -j = p\ in Fig. 13) The expected value of j is then 

E(j) = LjPrU) = jj + L jj) « -(6.0 + 2.0) =2.0. (2) 

Therefore in the worst case, the expectation of j is between 2 and 5. (See the appendix for out compu- 
tation of the progressions in (1) and (2).) ■ 

To insert a point x into the structure, we perform the above point location search which finds pi^nd within 
all the <2,-'s, flip coins to find out which 5,-'s x belongs to, then for each 5; containing x, insert x into p^ en d i n 
Qi as described in Section^ Note that by flipping coins we may create one or more new non-empty subsets 
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, • • • which contains only x, and we shall consequently create the new compressed quadtrees <2/+i , " • 
containing only x and add them into our data structure. Deleting a point x is similar. We do a search first to 
find Pi^nd in all <2,'s. Then for each Qj that contains x, delete x from Pi^nd in Qi as described in Section |2j 
and remove Qi from our data structure if Qi becomes empty. 

Theorem 3 Searching, inserting or deleting any point in a randomized d -dimensional skip quadtree of n 
points takes expected 0(log«) time. 

Proof: The expected number of non-empty subsets is obviously 0(\ogn). Therefore by Lemma |2j the 
expected searching time for any point is 0(1) per level, or 0(log«) overall. For any point to be inserted or 
deleted, the expected number of non-empty subsets containing this point is obviously 0(1). Therefore the 
searching time dominates the time of insertion and deletion. ■ 

In addition, note that the expected space usage for a skip quadtree is 0(n), since the expected size of the 
compressed quadtrees in the levels of the skip quadtree forms a geometrically decreasing sum that is 0(n). 



4 The Deterministic Skip Quadtree 

In the deterministic version of the skip quadtree data structure, we again maintain a sequence of subsets S; of 
the input points 5 with So = S and build a compressed quadtree Qj for each 5,. However, in the deterministic 
case, we make each Qj an ordered tree and sample Sj from S,_i in a different way. We can order the 2 d 
quarters of each <f -dimensional square (e.g., by the I, II, III, IV quadrants in R 2 as in Fig.^or by the lexical 
order of the J-dimensional coordinates in high dimensions), and call a compressed quadtree obeying such 
order an ordered compressed quadtree. Then we build an ordered compressed quadtree Qo for So = S and 
let Lo = L be the ordered list of So in Qo from left to right. Then we make a skip list L for L with L ( being 
the i-th level of the skip list. Let S, be the subset of S that corresponds to the i-th level L, of L , and build an 
ordered compressed quadtree Qj for each Sj. Let Xj be the copy of x at level i in L and Pi(x) be the smallest 
interesting square in Qj that contains x. Then, in addition to the pointers in Sec. |3j we put a bi-directed 

pointer between Xj and pi(x) for each x € S. (See Fig.|3]) 

Qo Qi 
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Figure 3: A deterministic skip quadtree guided by a deterministic 1-2-3 skip list. 



Lemma 4 The order of Si in Qi is L,-. 
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Proof: Noting that an interesting square in <2, is also an interesting square in Q,-_i, the LCA of two points 
x and y in <2, is also a CA of them in <2i-l- Therefore the order of 5, in £?, is a subsequence of the order of 
5,_i in Qi-\. By induction, the order of S,- in Qi is L,, given that the order of So in Qo is Lq. ■ 

The skip list L is implemented as a deterministic 1-2-3 skip list in [25], which maintains the property 
that between any two adjacent columns at level i there are 1, 2 or 3 columns of height i — 1 (Fig.13}. There 
are O(logrc) levels of the 1-2-3 skip list, so searching takes 0(log?i) time. Insertion and deletion can be 
done by a search plus 0(1) promotions or demotions at each level along the searching path. 

We also binarize Qo by adding d — 1 levels of dummy nodes, one level per dimension, between each 
interesting square and its 2 d quarters. Then we independently maintain a total order (the in-order in the 
binary Qo) for the set of interesting squares, dummy nodes and points in Qo- The order is maintained as 
in [13] which supports the following operations: 1) insert x before or after some y; 2) delete x; and 3) 
compare the order of two arbitrary x and y. All operations can be done in deterministic worst case constant 
time. These operations give a total order out from a linked list, which is necessary for us to search in L . 
Because of the binarization of Qo and the inclusion of all internal nodes of the binarized Qo in our total 
order, when we insert a point x into Qo, we get a y (parent of x in the binary tree) before or after the insertion 
point of x, so that we can accordingly insert x into our total order. 

In the full version, we give details for insertion and deletion in a deterministic skip quadtree (searching 
is the same as in the randomized version), proving the following: 

Theorem 5 Search, insertion and deletion in a deterministic d -dimensional skip quadtree of n points take 
worst case 0{\ogn) time. 

Likewise, the space complexity of a deterministic skip quadtree is 0{n). 

5 Approximate Range Queries 

In this section, we describe how to use a skip quadtree to perform approximate range queries, which are 
directed at reporting the points in S that belong to a query region (which can be an arbitrary convex shape 
having 0(1) description complexity). For simplicity of expression, however, we assume here that the query 
region is a hyper-sphere. We describe in the full version how to extend our approach to arbitrary convex 
ranges with constant description complexity. We use x, y for points in the data set and u, v for arbitrary points 
(locations) in K d . An approximate range query with error £ > is a triple (v, r,s) that reports all points x 
with d(v,x) < r but also some arbitrary points y with r < d(v,y) < (1 +s)r. That is, the query region R 
is a (hyper-) sphere with center v and radius r, and the permissible error range A is a (hyper-) annulus of 
thickness er around R. 

Suppose we have a space partition tree T with depth Dj where each tree node is associated with a region 
in R d . Given a query (v, r, e) with region R and annulus A, we call a node p G T an in, out, or stabbing node 
if the R (/ region associated with p is contained in R U A, has no intersection with R, or intersects both R and 
R(JA. Let S be the set of stabbing nodes in T whose child nodes are not stabbing. Then the query can be 
answered in 0(\S\Dj +k) time, with k being the output size. Previously studied space partition trees, such 
as BBD trees [4-6] and BAR trees [14, 16], have an upper-bound on \S\ of 0{z l ~ d ) and Dj of 0(log?i), 
which is optimal [5]. The ratio of the unit volume of A, (zr) d ' , to the lower bound of volume of A that is 
covered by any stabbing node is called the packing function p(n) of T, and is often used to bound \S\. A 
constant p(n) immediately results in the optimal |5| = 0(e ) for convex query regions, in which case the 
total volume of A is 0(8?^). 

Next we'll show that a skip quadtree data structure answers an approximate range query in 0(e l ~ d logn + 
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k) time. This matches the previous results (e.g., in [14, 16]), but skip quadtrees are fully dynamic and sig- 
nificantly simpler than BBD trees and BAR trees. 

Query Algorithm and Analysis. Given a skip quadtree Qo,Q\, - ■ ■ ,Qi and an approximate range query 
(v, r, e) with region R and annulus A, we define a critical square p G Qi as a stabbing node of Qj whose child 
nodes are either not stabbing, or still stabbing but cover less volume of R than p does. If we know the set 
C = Co of critical squares in Qq, then we can answer the query by simply reporting, for each p EC, the 
points inside every in-node child square of p in <2o- We now show that the size of C is 0(e l ~ d ) (due to the 
obvious constant p of quadtrees). 

Lemma 6 The number of critical squares in Qq is 0{z l ~ d ). 

Proof: Consider the inclusion tree T for the critical squares in C (that is, square p is an ancestor of square 
q in T iff p □ q ). We call a critical square a branching node if it has at least two children in T, or a non- 
branching node otherwise. A non-branching node either is a leaf of T, or covers more volume of R than 
its only child node in T does, by the definition of critical squares. Note that if two quadtree squares cover 
different areas (not necessarily disjoint) of R, then they must cover different areas of A. Therefore for each 
non-branching node p € T, there is a unique area of A covered by p but not by any other non-branching 
nodes of T. The volume of this area is clearly of the unit volume (er) d since p is a hypercube. Thus 
the total number of non-branching nodes in T is 0(e l ~ d ) since the total volume of A is 0(er d ). So |C| = \T\ 
is also 0(z l ~ d ). ■ 

Next we complete our approximate range query algorithm by showing how to find all critical squares in 
each Qj. The critical squares in Qj actually partition the stabbing nodes of Qi into equivalence classes such 
that the stabbing nodes in the same class cover the same area of R and A Each class corresponds to a path 
(could be a single node) in Qj with the tail (lowest) being a critical square and the head (if not the root of 
Qi) being the child of a critical square. For each such path we color the head red and the tail green (a node 
could be double colored). The set of green nodes (critical squares) is Q. 

Assume we have the above coloring done in <2i+l- We copy the colors to Qj and call the corresponding 
nodes p G Qj initially green or initially red accordingly. Then from each initially green node we DFS search 
down the subtree of Qi for critical squares. In addition to turning back at each stabbing node with no stabbing 
child, we also turning back at each initially red node. During the search, we color (newly) green or red to the 
nodes we find according to how the colors are defined above. After we've done the search from all initially 
green nodes, we erase the initial colors and keep only the new ones. 

Lemma 7 The above algorithm correctly finds the set Cj of all critical squares in Qi in 0{\Ci + \ |) time, so 
that finds C = Co in 0(\C\ logn) time, which is the expected running time for randomized skip quadtrees or 
the worst case running time for deterministic skip quadtrees. 

Proof: Correctness. If a stabbing node p has its closest initial red ancestor p' lower than its closest initial 
green ancestor p", then it will be missed since when searching from p", p will be blocked by p'. Let p' and 
p" be a pair of initially red and green nodes which correspond to the head and tail of a path of equivalent 
stabbing squares in <2, + i, and p be a stabbing square in Qi that is a descendant of p' . Note that, if p covers 
less area of R than p' and p" do, then p is also a descendant of p"; otherwise if p covers the same area of 
R as p' and p" do but p is not a descendant of p" , then p is not critical because p" is now contained in p. 
Therefore we won't miss any critical squares. 

Running time. By the same arguments as in Lemma |2j the DFS search downward each initially green 
node p has constant depth, because within constant steps we'll meet a square q which is a child node of p in 
Qi+i. q is either not stabbing or colored initially red so we'll go back. ■ 

Following Lemma loTTl and the algorithm description, we immediately get 
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Theorem 8 We can answer an approximate range query (v,r,e) in 0(e l ~ d \ogn + k) time with k being the 
output size, which is the expected running time for randomized skip quadtrees or the worst case running 
time for deterministic skip quadtrees. 

References 

[1] S. Alum. Quadtrees and octrees. In D. P. Mehta and S. Sahni, editors, Handbook of Data Structures and Applications, pages 
19-1-19-26. Chapman & Hall/CRC, 2005. 

[2] S. Alum and F. E. Sevilgen. Dynamic compressed hyperoctrees with application to the N-body problem. In Proc. 19th Conf. 
Found. Softw. Tech. Theoret. Comput. Sci, volume 1738 of Lecture Notes Comput. Sci., pages 21-33. Springer- Verlag, 1999. 

[3] S. Arya and D. Mount. Computational geometry: Proximity and location. In D. P. Mehta and S. Sahni, editors, Handbook of 
Data Structures and Applications, pages 63-1-63-22. Chapman & Hall/CRC, 2005. 

[4] S. Arya and D. M. Mount. Approximate nearest neighbor queries in fixed dimensions. In Proc. 4th ACM-S1AM Sympos. 
Discrete Algorithms, pages 271-280, 1993. 

[5] S. Arya and D. M. Mount. Approximate range searching. Comput. Geom. Theory Appl., 17:135-152, 2000. 

[6] S. Arya, D. M. Mount, N. S. Netanyahu, R. Silverman, and A. Wu. An optimal algorithm for approximate nearest neighbor 
searching in fixed dimensions. J. ACM, 45:891-923, 1998. 

[7] T. Asano, M. Edahiro, H. Imai, M. Iri, and K. Murota. Practical use of bucketing techniques in computational geometry. In 
G. T. Toussaint, editor, Computational Geometry, pages 153-195. North-Holland, Amsterdam, Netherlands, 1985. 

[8] J. L. Bentley. Multidimensional binary search trees used for associative searching. Commun. ACM, 18(9):509-517, Sept. 
1975. 

[9] M. Bern. Approximate closest-point queries in high dimensions. Inform. Process. Lett., 45:95-99, 1993. 

[10] M. Bern, D. Eppstein, and S.-H. Teng. Parallel constmction of quadtrees and quality triangulations. In Proc. 3rd Workshop 
Algorithms Data Struct., volume 709 of Lecture Notes Comput. Sci., pages 188-199. Springer- Verlag, 1993. 

[11] K. L. Clarkson. Fast algorithms for the all nearest neighbors problem. In Proc. 24th Annu. IEEE Sympos. Found. Comput. 
Sci., pages 226-232, 1983. 

[12] M. de Berg, M. van Kreveld, M. Overmars, and O. Schwarzkopf. Computational Geometry: Algorithms and Applications. 
Springer- Verlag, Berlin, Germany, 2nd edition, 2000. 

[13] P. F. Dietz and D. D. Sleator. Two algorithms for maintaining order in a list. In Proc. 9th ACM STOC, pages 365-372, 1987. 

[14] C. A. Duncan. Balanced Aspect Ratio Trees. Ph.D. thesis, Department of Computer Science, Johns Hopkins University, 
Baltimore, Maryland, 1999. 

[15] C. A. Duncan and M. T. Goodrich. Approximate geometric query structures. In D. P. Mehta and S. Sahni, editors, Handbook 
of Data Structures and Applications , pages 26-1-26-17. Chapman & Hall/CRC, 2005. 

[16] C. A. Duncan, M. T. Goodrich, and S. Kobourov. Balanced aspect ratio trees: combining the advantages of k-d trees and 
octrees. /. Algorithms, 38:303-333, 2001. 

[17] A. Efrat, M. J. Katz, F. Nielsen, and M. Sharir. Dynamic data stmctures for fat objects and their applications. Comput. Geom. 
Theory Appl, 15:215-227, 2000. 

[18] A. Efrat, G. Rote, and M. Sharir. On the union of fat wedges and separating a collection of segments by a line. Comput. 
Geom. Theory Appl, 3:277-288, 1993. 

[19] K. Fujimura, H. Toriya, K. Tamaguchi, and T. L. Kunii. Octree algorithms for solid modeling. In Proc. Intergraphics '83, 
volume B2-1, pages 1-15, 1983. 

[20] E. N. Hanson and T. Johnson. The interval skip list: A data stmcture for finding all intervals that overlap a point. In Workshop 
on Algorithms and Data Structures (WADS), pages 153-164, 1991. 

[21] D. T. Lee. Interval, segment, range, and priority search trees. In D. P. Mehta and S. Sahni, editors, Handbook of Data 
Structures and Applications, pages 18-1-18-21. Chapman & Hall/CRC, 2005. 

[22] S. Leutenegger and M. A. Lopez. R-trees. In D. P. Mehta and S. Sahni, editors, Handbook of Data Structures and Applications, 
pages 21-1-21-23. Chapman & Hall/CRC, 2005. 

[23] M. A. Lopez and B. G. Nickerson. Analysis of half-space range search using the k-d search skip list. In 14th Canadian 
Conference on Computational Geometry, pages 58-62, 2002. 



10 



[24] J. I. Munro, T. Papadakis, and R. Sedgewick. Deterministic skip lists. In Proc. Annual ACM-S1AM Symposium on Discrete 
Algorithms (SODA), pages 367-375, 1992. 

[25] J. I. Munro, T. Papadakis, and R. Sedgewick. Deterministic skip lists. In Proceedings of the third annual ACM-SIAM 
symposium on Discrete algorithms (SODA), pages 367 - 375, 1992. 

[26] B. F. Naylor. Binary space partitioning trees. In D. P. Mehta and S. Sahni, editors, Handbook of Data Structures and 
Applications, pages 20-1-20-19. Chapman & Hall/CRC, 2005. 

[27] B. G. Nickerson. Skip list data structures for multidimensional data. Technical Report CS-TR-3262, 1994. 

[28] J. Nievergelt and P. Widmayer. Spatial data structures: Concepts and design choices. In J.-R. Sack and J. Urrutia, editors, 
Handbook of Computational Geometry, pages 725-764. Elsevier Science Publishers B.V. North-Holland, Amsterdam, 2000. 

[29] J. A. Orenstein. Multidimensional tries used for associative searching. Inform. Process. Lett., 13:150-157, 1982. 

[30] W. Pugh. Skip lists: a probabilistic alternative to balanced trees. Commun. ACM, 33(6):668-676, 1990. 

[31] H. Samet. Spatial Data Structures: Quadtrees, Octrees, and Other Hierarchical Methods. Addison- Wesley, Reading, MA, 
1989. 

[32] H. Samet. Applications of Spatial Data Structures: Computer Graphics, Image Processing, and GIS. Addison- Wesley, 
Reading, MA, 1990. 

[33] H. Samet. Spatial data structures. In W. Kim, editor, Modern Database Systems, The Object Model, Interoperability and 
Beyond, pages 361-385. ACM Press and Addison- Wesley, 1995. 

[34] H. Samet. Multidimensional data structures. In M. J. Atallah, editor, Algorithms and Theory of Computation Handbook, 
pages 18-1-18-28. CRC Press, 1999. 

[35] H. Samet. Multidimensional spatial data structures. In D. P. Mehta and S. Sahni, editors, Handbook of Data Structures and 
Applications, pages 16-1-16-29. Chapman & Hall/CRC, 2005. 



11 



A Appendix 

If f(x) > is a monotone decreasing function for x > i, then the progression of f(x) can be approximated 
by its integral as following: 

i — 1 p co °° i poo 

£ /(*) + / f(x)dx < £ fix) < £ f(x) + / f(x)dx. 
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